Dog Breed Rank Prediction using Random Forest Regressor

Best-Fit a Dog for me?

https://www.kaggle.com/datasets/pranav941/hows-that-dog-for-me

Preprocessing

Visualization

Modelling

Evaluation

R-squared score = 21%, meaning the predictors can explain 21% of the variation in the target variable. This result makes the model far below sufficient to be used in real-life situation and should be improved or scrapped entirely.

Improving the Model

I decide to remove all features below 4% importance.

Evaluation

R-squared score = 24%, meaning the predictors can explain 24% of the variation in the target variable. While the result has improved, it is still far below the ideal value with at least 70%.